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Cross^eferen^^ 

ffled October 13, 2000, from which application priority is claimed under 35 USC 

Hied ucio unrated herein by reference in its entirety. 

§ 1 19(e)(1) and which application is incorporated herein oy 

I§ ^ mM . , onrelates gener ally to recombinant gene expression systems. 
The present invention relates generally w 

fragments for use in expression constructs for expressing ge P 
using the same. 

RacksroundOrTheJnyentwn 

mMSL —~: a — i^prodncedinavarie^ofprocaryoHcandenc^ 

vectors are used for purposes ranging from e.g., preliminary evaluate f the ^ 

H fo, example for gene therapy and nucleic acid vaccmanon. 
vectors are used, for example, g ^ ^ ^ 

In general, effective vectors are those that express nrgn 

plasmid from endocydc vesicles after endoeytosis ; (3) translocate of *e pla^ntd 
tcyt„plasm,n,o, h enuc,eus ; a» d (4) tt anscri P Uonofmep,asrnid,n.he„uc,eus. 
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Work from several laboratories suggests that a major barrier to efficient . 
transfection is translocation of the plasmid into the nucleus, particularly in cells that do 
no, undergo mitosis (e.g., myocytes). One parameter that may affect this step is the s,ze 
of the plasmid, as the nuclear pore complex involved in uptake of macromolecules .nto 
the nucleus has a finite size. Hence, it is desirable to engineer small ptemids mat retatn 
the ability to express proteins a. high levels. This has the potential to facilitate DNA 
delivery and allows the insertion of larger gene inserts man is feasible in larger plasmtds. 
The latter point is particularly important for preparation of certain recombmant vtral 
vectors that have a limited capacity to package plasmids, such as alphavirus and adeno- 
associated vectors. 

One particularly effective system for the production of recombinant protems 
employs vectors conuuning the human cytomegalovirus (hCMV) immediate-early (El) 
ermancer/promoter region which controls transcription of the immediate-early 72,000 
molecular weight protein of hCMV. See, e.g., Chapman e. al., Nuc. Acids Res. (1991) 
19-3979-3986- and U.S. Patent No. 5,688,688. The hCMV IE1 enhancer/promoter ts one 
of the strongest enhancer/promoters known and is active in a broad range of cell types. 

The hCMV IE1 enhancer/promoter region (Figure 2) includes a tissue-specfic 
modulator, multiple potential binding sites for several different transcription factors, and 
a complex enhancer. The transcribed region of the gene contains four exons and three 
introns. The largest of the introns, termed "Intron A," is found within the 5'-untnansla,ed 
region of the gene. See, e.g., Chapman e, a,., Nuc. Acids Res. (1991) 12:3979-3986 for 
the sequence and structure of this region in hCMV strain Towne, and Akrigg et a.., V,rus 
Res (1985) 2:107-121, for a description of the corresponding region m hCMV strain 
AD169 The Intron A region of the hCMV ffil enhancer/promoter has been shown to 
contain e.ements that enhance expression of heterologous proteins in mammalian ells. 
See e g., Chapman et al., Nuc. Acids Res. (1991) 19:3979-3986. 

totrons are non-coding regions present in most pre-mRNA transcripts produced m 
the mammalian cell nucleus. Intron sequences can profoundly enhance gene express™ 
when included in heterologous expression vectors. See, e.g., Buchman et a.., Molec. Cell 
3 Biol (1988) 8:4395-4405; Chapman et a.., M* Acids Res. (1991) 19:3979-3986. Recent 
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nucleus of mature mRNAs to ,he cytoplasm. Culler, B.R., Proc. ML Acad. Sc. USA 
(2000) 97:4-6; and Luo et a.., Proc. Nail. Acai. Sci. USA (1999) 96,14937-14942. 
Accordingly, increased levels of expression, such as those seen with the Intron A reg.cn 
of the hCMV IE1 enhancer/promoter, may be due to increased levels of translatable 
mRNAs in the cytoplasm. 
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Summary of the Invention 

Accordingly, the present invention provides CMV Intron A fragments for use m 
expression constructs. The fragments retain the ability to enhance expression levels when 
present in such expression constructs. The use of Intron A fragments is destrab.e, 
especially when used in recombinant viral vectors with size constraints for packagmg 
plasmids, such as alphavirus and adeno-associated vectors. Thus, the present invent™ 

in therapeutically useful quantities, both in vitro and in vivo. 

Accordingly, in one embodiment, the subject invention is directed to an hCMV 
Intron A fragment, wherein the fragment lacks the full-length Intron A sequence and 
comprises : (a) a sequence of nucleotides having at leas, about 75% sequence identity to 
the contiguous sequence of nucleotides found at positions 1-25, inclusive, of F.gure 1A, 
and (b) a sequence of nucleotides having a. least about 75% sequence identity to the 
contiguous sequence of nucleotides found a. positions 775-820, inclusive, of F.gure 1A. 
Further, when the fragment is present in an expression construe the expression construct 
achieves expression levels greater than those levels achieved by a correspond.^ 
construct that completely lacks an Intron A sequence. In certain embod.ments, the 
expression levels achieved are a. least two-fold, or at leas, ten-fold, or at leas, fifty-fold 
greater than those levels achieved by a corresponding construct that completely lacks an 
Intron A sequence. 

In another embodiment, the invention is directed to an Intron A fragment mat 
comprises: (a) a sequence of nucleotides having at leas, about 75% sequence identity to 
the contiguous sequence of nucleotides found at positions 1-51, inclusive, of F.gure 1A, 
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and <b) a sequence of nuclides having a, .eas, abou, 75% sequence identity to .he 
contiguous sequence of nuclides found atpositions 74.-820, inclusive, of Frgure 1 A, 
wherein when the fragment is present in an expression construe,, the expression construe, 
achieves expression levels greater than .hose levels achieved by a corresponding 
construct, completely lacks an Intron A sequence. In certain embodiments *e 
expression levels achieved are a, leas, Uvo-fold, or a, .eas, ten-fold, or a, .eas, fifty-fo.d 
greater man tirose >eve!s achieved by a corresponding construe, tha, completely lacks an 

Intron A sequence. 

In another embodiment, the Intron A fragment comprises the sequence of 
nucleotides 1-51, inclusive, of Hgure 1 A, linked to nucleotides 741-820, inclusive, of 
Figure 1A. 

In still a further embodiment the Inlron A fragment comprises me Intron A 
nucleotide sequence depicted in Figure 1C, or a nucleotide sequence with a, least abou, 
75% sequence identity thereto. 

ta anomer embodiment me Intion A fragmen, consists of me Intion A nucleotide 

sequence depicted in Figure 1C. 

In yet anomer embodiment me invention is directed to an hCMV Intron A 
fragmen,, wherein the fragmen, lacks me full-lengm Intion A sequence and compnses: 

(a) a sequence of nucleotides having at least abou, 75% sequence identity to me 
contiguous sequenceof nucleotides found a, positions 1-25, inclusive, of Figure 1A, atrd 

(b) a sequence of nucleotides having a, leas, about 75% sequence identity to me 
contiguous sequence of nucleotides found a, positions 775-820, inclusive, of Figure 1 A, 
wherein when me fragmen. is present in an expression constiuc,, «he expression construe, 
achieves expression levels equal ,o, or greater ,han, mose levels achieved by an 
expressionconsti™,ma,includesa™^ 

,„ anomer embodiment, me invention is directed .o an hCMV Intion A fragment 
wherein the fragmen, lacks me full-length Intron A sequence and comprises: (a) a 
sequence of nucleotides having a, least about 75% sequence identity to me contiguous 
sequence of nucleotides found at positions 1-51, inclusive, of Figure 1 A, and (b) a 
sequence of nucleotides having at leas, about 75% sequence identity ,o tire contiguous 
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sequence of nucleotides found a. positions 74,-820, inclusive, of Figure ,A, wherein 
when the fragment is present in an expression construct, the expression construe, 
achieves expression levels equal to, or greater than, those levels achieved by an 
expression construe, that includes a corresponding intact, full-length Mron A sequence. 

to further embodiment me invention is directed ,o recombinant expresston 
constructs comprising (a) a coding sequence; and (b, control elements tha, are operably 
iinked.o.hecodingsequence.wherein.heconttole.emen^comprisetheln.ronA 

fragment described herein, whereby the coding sequence can be transcribed and translated 
in a host cell. In certain embodiments, the control elements farther comprise a promoter 
selected from the group consisting of an SV40 early promoter, a CMV promoter, a mouse 
mammary tumor virus LTR promoter, an adenovirus major late promoter, an RSV 
promoter, a SRa promoter, and a herpes simplex vinrs promoter. Particularly, the control 
elements may comprise the hCMV immediate-early (IE!) enhancer/promoter reg,on 
found a. nucleotide positions 460 to 12 64ofFigure2,andExon2ofute5--UT R 
comprismg the sequence of nuclides depicted at positions 821-834, inclusive of 
Figure 1 A. Host cells comprising the expression construes and methods of producng a 
recombinant polypeptide are also provided. 

to another embodiment, the invention is directed to a polynucleotide eompnsmg 

the sequence depicted in Figure 5B. 

These and other aspects of the presen, invention will become evident upon 
reference to the following delailed description and attached drawings. In addrtron, 
various references are se, forth herein which describe in more detail certain procedures or 
compositions, and are therefore incorporated by reference in their enttrety. 

pp.f rwraH pfinn of the Drawings 

Figure 1 A (SEQ ID NO:l) shows the sequence of a representative CMV IE1 
Intron A from hCMV strain Towne. Also shown in Figure 1 A is the portion of the 
sequence deleted from deletion mutant pCON3. The splice donor sequence is bolded and 
shown with an arrow. The splice acceptor sequence is underlined and designated wtth an 
0 arrow. Possible branch points are indicated. 
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Figure IB (SEQ ID N0:2) shows the oligonucleotide corresponding to the 
retained 3'-portion of the deleted Intron A construct of P CON3 as compared with the 3'- 

portion of wild-type Intron A. 

Figure 1C (SEQ ID NO:3) shows the Intron A sequence of deletion mutant 

5 pCON3. 

Figure 2 (SEQ ID NO:4) (GenBank accession number M60321 and Chapman et 
al Nuc Acids Res. (1991) 19:3979-3986) shows the nucleotide sequence of the 5' region 
of the major immediate-early gene of hCMV, including the enhancer/promoter region. 
The enhancer region (nucleotides -600 to -1081), the Pol E promoter (nucleotides 1081- 
10 1 143) Exon 1 of the 5' UTR (nucleotides 1144-1264), Intron A (nucleotides 1265-2088) 
and Exon 2 of the 5' UTR (nucleotides 2089-2096) are shown. The TATAA and CAAT 
boxes, as well as the start codon, are boxed. 

Figure 3 shows various Intron A deletion mutants as described in the examples. 
Figure 4 depicts normalized luciferase expression by the various deletion mutants 

15 shown in Figures 1C and Figure 3. 

Figure 5A (SEQ ID NO:5) shows the wild-type rabbit P-globin gene sequence 

used in the examples. 

Figure 5B (SEQ ID NO:6) shows the optimized rabbit p-globin gene sequence 

used in the examples. 

20 Figure 6 shows luciferase expression as a measure of P 55gag expression by parent 

vector pCMVkm-Luciferase, as compared to R P G-IVSI (containing the wild-type rabbit 
p-globin gene sequence shown in Figure 4A) and R P G-OPTI (containing the optmnzed 
rabbit p-globin gene sequence shown in Figure 4B) 

Figure 7 depicts anti-p55gag titers from mice immunized with various constructs 
including the Intron A fragment, as described in the examples. 
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retailed Desr rir f ^" ftf the Invention 

The practice of the present invention will employ, unless otherwise indicated, 
conventional methods of chemistry, biochemistry, recombinant DNA techniques and 
immunology, within the skill of the art. Such techniques are explained fully m the 
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Uterature. See,e,.,Sambro* etal.^C^, 

Edition); Methods ,n Emy^ (S. Colowi* and N. Kaplan ed,, Academe Pre. 
Inc , L Cloning VoU. I and H (D.N. G.over ed.) ; OHonucleoMe Syn. es* (MX 
Gait ed.,; AW* A* (B.D. Ham* * S.J. Higgins eds.); W « 

G*« (R.K. Freshney ed.); Perbal, B., A Practical Guide to Molecular Clontng. 

AU pupations, patent and patent applications cited herein, whether supra or 
infra, are hereby incorporated by reference in their entirety. 

It must be noted that, as used in this specification and the appended claims, the 
singular forms 'V, "an" and "the" include plural referents unless the content clear* 
dJL otherwise. Thus, for example, reference to "an antigen" inc.udes a mixture of 

two or more antigens, and the like. 

The following amino acid abbreviations are used throughout the text: 

Alanine: Ala (A) Arginine: Arg (R) 

Asparagine: Asn (N) Aspartic acid: Asp <D) 

Cysteine: Cys(C) Glutamine: Gin (Q) 

Glutamic acid: Glu (E) Glycine: Gly (G) 

Histidine:His(H) Isoleucine: He (I) 

Leucine: Leu (L) Lysine: Lys (K) 

Methionine: Met (M) Phenylalanine: Phe (F) 

Proline: Pro (P) Serine: Ser (S) 

Threonine: Thr (T) Tryptophan: Trp (W) 

Tyrosine: Tyr(Y) Valine: Val (V) 

t Definitions 

m describing the present invention, die following terms will be employed, and are 
5 intended to be defined as indicated below. 

By "Intron A fragment" is meant a fragment derived from an Intron A sequence of 
a CMV immediate-early enhancer/promoter region, which does not include the entire 
Mron A sequence. A representative hCMV enhancer/promoter region is shown ,n Figure 
2 The Mac, Intron A sequence is represented by the lowercase nucleotides spanning 



7 



PP16095.002 
PATENT 




10 



15 



20 



25 



positions 1265-2088 of Figure 2 and nucleotides 1-820 of Figure 1 A. The Intron A 
fragment of the present invention comprises a deletion from the full-length sequence, 
which deletion may be interna, or occur at the 5'- and/or 3'-ends of the Intron A regron, so 
l0 „g as the region still functions to permit authentic splicing in the nucleus of pnrnary 
^ripts that include the Intron A fragment. Preferably, an "Intron A fragment 
includes the minimum number of bases or elements necessary to achieve exprcsston 
,eve,s over those achieved in corresponding constructs that complete., lack an Inti-on A 
sequence. More preferably, expression levels achieved by constructs that include the 
Intron A fragment of the invention are at leas, two-fold over those levels achteved 
without the presence of the Intron A region, preferably at least ten-fold greater, most 
preferably at least twenty- to fifty-fold greater, or more, than those levels achteved 
without the intron A region. Preferably, expression levels are a, least equal to, or greater 
man for example a. least two-fold greaterthan, those levels achieved when the tntact 

comparisons are typically made by making expression constructs that include all e emen* 
of the test constat, but either completely lack the Intron A sequence, or tnclude the full- 
length Intron A sequence (see the Examples herein). 

Thus an "Intron A fragment" of Represent invention will generally tnclude at 
toas, the 5' splice junction sequence (nucleotides 1-7 as shown in Figure 1A), usually a. 
least up to the fust 25 5'-nucleotides of the mtron A region (nucleotides 1-25 of F.gure 
, A) more preferably at leas, up to the Erst 30 nucleotides of the Intron A regton 
(nucleotides 1-30 of Figure 1 A), even more preferably a, leas, up ,o the first 40 
nucleotides of the Intron A region (nucleotides 1-40 of Figure 1 A), more preferably a 
,eas. up to tire first 51 nucleotides of the Intron A region (nucleotides 1-51 of Figure 1 A), 
and even up to the firs, 75 or more nucleotides of me Intron A region, and any mteger 
between these values, or even more of the 5'-region of Intron A. 

Moreover, in addition to the 5'-sequence described above, an "Intron A fragment 
win optionally include at leas, me 3' splice junction sequence (nucleotides 815-820 of 
Figure 1 A). Generally, me Intron A fragment will include at least up to the 25 3'- 
m c,eo,ides of the Infron A sequence shown in Figure 1 A (nucleotides 796-820 of Figure 
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1 A) preferably up to the 50 3'-nucleot,des of the sequence shown m F.gure 1 A 
(nucleotides 771-820 of Figure 1 A), more generaUy up to rhe 70 .-nuclides 
(nucleotides 75.-820 of Figure 1A ), preferably at teas, up to the 80 3'-nuc,eotides 
(nuclides 741-820 of Figure !A), or even more of the 3'-region, such as the 100-150 
■.nucleotides, and any integer between these values, or more of the J'-regton of Intron A. 

Thus it is apparent that an Intron A fragment according to the present mvenfon 
may include a variety of internal deletions, such as about 10 to about 750 or more 
nucleotides of the Intron A sequence, preferably about 25 to about 700 or more 
nuclides, more preferably about 50 to about 700 nucleotides, and most preferably 
about 500 to about 680-690 or more nucleotides, or any integer between the above 
ranges,so.ongasanexpressionconstruc,includingmeIn W nA fe gmente im er^ 

enhances expression relative to a corresponding construct completely lacking an Intron A 
sequence, or provides equivalent or enhanced expression relative to a correspondmg 
construct which includes the entire Intron A sequence, as described above. 

The retained 5'- and 3* of the Intron A fragment of me present mvenfon 
may be directly linked to one another, e.g., as shown in Figure 1 A, or the «d *■ 
regi „„s of the intron A fragment may be linked together via a linker sequence. The taker 
sequence may comprise from 1 up to about 400 or more nucleotides, or any mteger 
between these values, and may comprise regions for particular transcript factors, such as 
NF, binding sites, tissue-specific enhancer sequences, such as muscle-specific enhancers, 
and the like. 

One representative Intron A fragment sequence comprises the sequence of 
nucleotides at positions 1-51 linked to nucleotides 741-820, of Figure 1A, thus 
comprising an internal deletion of nucleotides 52-740, as shown in Figure 1 A. Also 
induded in this construct is Exon 2 of the 5' UTR of the hCMV enhancer/promoter 
region, nucleotides 821-834 of Figure 1 A. 

An "Intron A fragment" as used herein, encompasses sequences with tdeutity to 
an Intron A fragment isolated from any of the various hCMV strains, such as for example 
»CMV strain Towne and hCMV strain AD169, as well as polynucleotides that are 
0 substantially homologous to the reference molecule (as defined be.ow) and which sttl. 
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function as described above. Thus, for example, the fragment shown in Figure 1C 
includes nucleotide substitutions at the branch points and in the polypyrimidine tract to 
conform these sequences to consensus sequences, as shown in Figures IB and 1C. 
Preferably, but not necessarily, the branch points retain termination codons, i.e., TAA, 
TAG or TGA Moreover, portions of the molecule outside of the splice donor and splice 
acceptor regions are more amenable to change. In this regard, it is preferable to retam the 
5' GT found at the 5' splice junction, and preferably the first six base pairs found at the 5' 
splice junction. It is also preferable to retain the 3' AG found at the 3' splice junction, 
preferably the three base pairs, CAG, found at the 3' splice junction. The nucleotides 
found in these regions are preferably at least 80% homologous to the sequence of 
nucleotides present in the native sequence shown in Figure 1 A, but may be less 
homologous as long as the Intron A fragment retains function, as defined above. Further, 
the polypyrimidine tract region is preferably one where substantially all of the bases are 
TsorCs. 

The terms "polypeptide" and "protein" refer to a polymer of ammo actd residues 
and are not limited to a minimum length of the produet. Thus, peptides, oligopeptides, 
aimers, multimers, and the like, are included within the definition. Both fun-length 
proteins and fragments thereof are encompassed by me definition. The terms also molude 
postexpression modifications of the polypeptide, for example, glycosylate, acetylatton, 

phosphorylation and the like. 

For purposes of the present invention, the polypeptide expressed by the coding 
sequence may be one useful in a vaccine, therapeutic or diagnostic and may be derived 
from any of several known viruses, bacteria, parasites and fungi, as well as any of the 
various tumor antigens. Alternatively, the expressed polypeptide may be a therapeutic 
hormone, a transcription or translation mediator, an enzyme, an intermediate in a 
metabolic pathway, an immunomodulator, and the like. 

Furthermore, for purposes of the present invention, a "polypeptide" refers to a 
protein which includes modifications, such as deletions, additions and substitutions 
(generally conservative in nature), to the native sequence, so long as the protein maintains 
3 the desired activity. These modifications may be deliberate, as through site-directed 
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mutagenesis, or may be serendipitous, such as through mutations of hosts which produce 
the proteins or errors due to PCR amplification. 

A "coding sequence" or a sequence which "encodes" a selected polypeptide, is a 
nucleic acid molecule which is transcribed (in the case of DNA) and translated (in the 
case of mRNA) into a polypeptide in vitro or *, vivo when placed under the control of 
appropriate regulatory sequences. The boundaries of the coding sequence are determmed 
by a start codon at the 5' (amino) terminus and a translation stop codon at the 3' (carboxy) 
terminus. A coding sequence can include, but is not limited to, cDNA from viral, 
procaryotic or eucaryotic mRNA, genomic DNA sequences from viral (e.g. DNA viruses 
and retroviruses) or procaryotic DNA, and synthetic DNA sequences. A transcription 
termination sequence may be located 3' to the coding sequence. 

A "nucleic acid" molecule can include both double- and single-stranded 
sequences and refers to, but is not limited to, cDNA from viral, procaryotic or eucaryotic 
mRNA genomic DNA sequences from viral (e.g. DNA viruses and retroviruses) or 
procaryotic DNA, and especially synthetic DNA sequences. The term also captures 
sequences that include any of the known base analogs of DNA and RNA. 

"Operably linked" refers to an arrangement of elements wherein the components 
so described are configured so as to perform their desired function. Thus, a given 
promoter operably linked to a coding sequence is capable of effecting the expression of 
the coding sequence when the proper transcription factors, etc., are present. The 
promoter need not be contiguous with the coding sequence, so long as it functions to 
direct the expression thereof. Thus, for example, intervening untranslated yet transcribed 
sequences can be present between the promoter sequence and the coding sequence, as can 
transcribed introns, and the promoter sequence can still be considered "operably linked" 

to the coding sequence. 

"Recombinant" as used herein to describe a nucleic acid molecule means a 
polynucleotide of genomic, cDNA, viral, semisynthetic, or synthetic origin which, by 
virtue of its origin or manipulation is not associated with all or a portion of the 
polynucleotide with which it is associated in nature. The term "recombinant" as used 
with respect to a protein or polypeptide means a polypeptide produced by expression of a 
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recombinant polypeptide. In general, d- S- »f interest is Coned and then expressed 
i^ansfonnedorganisms.asdescribedte.herbe.ow.^hostorganism expresses the 

foreign gene to produce the protein under expression condttions. 

A "control element" refers to a polynucleotide sequence which atds m the 
expression of a coding sequence to which it is linked, lb. term includes promoters, 
transcription termination sequences, upstream regulatory domains, po.yadenylation 
signals, untranslatedregions, including S'-UTRs (such as Ex„n2 of thehCMV 
enhancer/promoter region 5'-UTO) and 3'-UTRs and when appropriate, leader sequences 
and enhancers, which collectively provide for the transcription and translation of a codmg 

3 sequence in a host cell. 

A "promoter" as used herein is a DNA regulatory region capable of bmdmg RNA 
polymerase inahostcell and initiating transcription of a downstream (3' direction) 
coding sequence operab.y linked thereto. For purposes of the present invention, a 
promoter sequence includes the minimum number of bases or elements necessary to 
5 initiate transcription of a gene of interest at ,eve,s detectable above background. Wflnn 
the promoter sequence is a transcription initiation site, as well as protein binding domams 
(consensus sequences) responsible for me binding of RNA polymerase. Eucaryotic 
promoters wil, often, but no. always, contain "TATAA" boxes and "CAAT" boxes 

A control sequence "directs the transcription" of a coding sequence in a cell when 
20 RNA polymerase will bind the promoter sequence and transcribe the coding sequence 
into mRNA, which is then translated into the polypeptide encoded by the codmg 
sequence. 

A "host cell" is a cell which has been transformed, or is capable of transformation, 

by an exogenous DNA sequence. 

25 A "heterologous" region of a DNA construct is an identifiable segment of DNA 

within or attached to another DNA molecule that is no. found in association wtth the 
other molecule in nature. For example, a sequence encoding a human protein other than 
the immediate-early 72,000 molecular weight protein of hCMV is considered a 
heterologous sequence when linked to an hCMV IE1 enhancer/promoter. Smularly, a 

30 sequence encoding the immediate-early 72,000 molecular weight protein of hCMV wtll 
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be considered heterologous when linked to an hCMV promoter with which it is not 
normally associated. Another example of a heterologous coding sequence is a construct 
where the coding sequence itself is not found in nature (e.g., synthetic sequences having 
codons different from the native gene). Allelic variation or naturally occurring 
mutational events do not give rise to a heterologous region of DNA, as used herein. 

By "selectable marker" is meant a gene which confers a phenotype on a cell 
expressing the marker, such that the cell can be identified under appropriate conditions. 
Generally, a selectable marker allows selection of transected cells based on their ability to 
thrive in the presence or absence of a chemical or other agent that inhibits an essential 
cell function. Suitable markers, therefore, include genes coding for proteins which confer 
drug resistance or sensitivity thereto, impart color to, or change the antigenic 
characteristics of those cells transfected with a nucleic acid element containing the 
selectable marker when the cells are grown in an appropriate selective medium. For 
example, selectable markers include: cytotoxic markers and drug resistance markers, 
whereby cells are selected by their ability to grow on media containing one or more of the 
cytotoxins or drugs; auxotrophic markers by which cells are selected by their ability to 
grow on defined media with or without particular nutrients or supplements, such as 
thymidine and hypoxanthine; metabolic markers by which cells are selected for, e.g., their 
ability to grow on defined media containing the appropriate sugar as the sole carbon 
source, or markers which confer the ability of cells to form colored colonies on 
chromogenic substrates or cause cells to fluoresce. Representative selectable markers are 

described in more detail below. 

"Expression cassette" or "expression construct" refers to an assembly which is 
capable of directing the expression of the sequence(s) or gene(s) of interest. The 
expression cassette includes control elements, as described above, such as a promoter or 
promoter/enhancer (such as the hCMV IE1 enhancer/promoter) which is operably linked 
to (so as to direct transcription of) the sequence(s) or gene(s) of interest, and often 
includes a polyadenylation sequence as well. An expression cassette will also include an 
Intron A fragment as defined above and, optionally, Exon 2 of the hCMV IE1 
enhancer/promoter region. Within certain embodiments of the invention, the expression 
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cassette described herein may be contained within a plasmid construct. In addition to the 
components of the expression cassette, the plasmid construct may also include, one or 
more selectable markers, a signal which allows the plasmid construct to exist as single- 
stranded DNA (e.g., a M13 origin of replication), at least one multiple cloning site, and a 
"mammalian" origin of replication (e.g., a SV40 or adenovirus origin of replication). 

"Transformation," as used herein, refers to the insertion of an exogenous 
polynucleotide into a host cell, irrespective of the method used for insertion: for 
example, transformation by direct uptake, transfection, infection, and the like. For 
particular methods of transfection, see further below. The exogenous polynucleotide may 
be maintained as a nonintegrated vector, for example, an episome, or alternatively, may 
be integrated into the host genome. 

By "isolated" is meant, when referring to a polypeptide, that the indicated 
molecule is separate and discrete from the whole organism with which the molecule is 
found in nature or is present in the substantial absence of other biological macro- 
molecules of the same type. The term "isolated" with respect to a polynucleotide is a 
nucleic acid molecule devoid, in whole or part, of sequences normally associated with it 
in nature; or a sequence, as it exists in nature, but having heterologous sequences in 
association therewith; or a molecule disassociated from the chromosome. 

"Homology" refers to the percent identity between two polynucleotide or two 
polypeptide moieties. Two DNA, or two polypeptide sequences are "substantially 
homologous" to each other when the sequences exhibit at least about 50% , preferably at 
least about 75%, more preferably at least about 80%-85% (80, 81, 82, 83, 84, 85%), 
preferably at least about 90%, and most preferably at least about 95%-98% (95, 96, 97, 
98%), or more, or any integer within the range of 50% to 100%, sequence identity over a 
defined length of the molecules. As used herein, substantially homologous also refers to 
sequences showing complete identity to the specified DNA or polypeptide sequence. 

In general, "identity" refers to an exact nucleotide-to-nucleotide or amino acid-to- 
amino acid correspondence of two polynucleotides or polypeptide sequences, 
respectively. Percent identity can be determined by a direct comparison of the sequence 
information between two molecules by aligning the sequences, counting the exact number 
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of matches between the two aligned sequences, dividing by the length of the shorter 
sequence, and multiplying the result by 100. Readily available computer programs can 
be used to aid in the analysis, such as ALIGN, Dayhoff, M.O. in Atlas of Protein 
Sequence and Structure M.O. Dayhoff ed., 5 Suppl. 3:353-358, National biomedical 
Research Foundation, Washington, DC, which adapts the local homology algorithm of 
Smith and Waterman Advances in Appl. Math. 2:482-489, 1981 for peptide analysis. 
Programs for determining nucleotide sequence identity are available in the Wisconsin 
Sequence Analysis Package, Version 8 (available from Genetics Computer Group, 
Madison, WI) for example, the BESTFIT, FASTA and GAP programs, which also rely 
on the Smith and Waterman algorithm. These programs are readily utilized with the 
default parameters recommended by the manufacturer and described in the Wisconsin 
Sequence Analysis Package referred to above. For example, percent identity of a 
particular nucleotide sequence to a reference sequence can be determined using the 
homology algorithm of Smith and Waterman with a default scoring table and a gap 
penalty of six nucleotide positions. 

Another method of establishing percent identity in the context of the present 
invention is to use the MPSRCH package of programs copyrighted by the University of 
Edinburgh, developed by John F. Collins and Shane S. Sturrok, and distributed by 
IntelliGenetics, Inc. (Mountain View, CA). From this suite of packages the Smith- 
Waterman algorithm can be employed where default parameters are used for the scoring 
table (for example, gap open penalty of 12, gap extension penalty of one, and a gap of 
six). From the data generated the "Match" value reflects "sequence identity." Other 
suitable programs for calculating the percent identity or similarity between sequences are 
generally known in the art, for example, another alignment program is BLAST, used with 
default parameters. For example, BLASTN and BLASTP can be used using the 
following default parameters: genetic code = standard; filter = none; strand = both; cutoff 
= 60; expect = 10; Matrix = BLOSUM62; Descriptions = 50 sequences; sort by = HIGH 
SCORE; Databases = non-redundant, GenBank + EMBL + DDBJ + PDB + GenBank 
CDS translations + Swiss protein + Spupdate + PIR. Details of these programs can be 
found at the following internet address: http://www.ncbi.nlm.gov/cgi-binmLAST. 
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Alternatively, homology ean be determined by hybridization of polynueleotides 
under conditions which form stable duplexes between homologous regions, Mowed by 
digestion with singie-stianded-specifrc nuclease(s), and size determination of the dtgested 
fragment, DNA sequences that are substantially homologous can be identrfied in a 
Southern hybridization experiment under, for example, stringent conditions, as denned 
for that particular system. Denning appropriate hybridization conditions is wtthm tire 
skill of the art. See, e.g., Sambrook et a.., supra; DNA Clonic supra; Nucleic Acui 
Hybridization, supra. 

n of Carrr ir """he Invention 

Before describing the present invention in detail, it is to be understood that this 
invention is not limited to particular formulations or process parameters as such may, of 
course vary. It is also to be understood that the terminology used herein ,s for the 
purpose of describing particular embodiments of the invention only, and is no. mtended 

to be limiting. 

Although a number of compositions and methods similar or equivalent to those 
described herein can be used in the practice of the present invention, the preferred 
materials and methods are described herein. 

As noted above, the present invention is based on the discovery of novel hCMV 
mtron A fragments which are able to enhance expression of a downstream (3') sequence 
relative to expression leve.s achieved in the absence of an Intron A sequence, or at leas, 
provide for equivalent expression levels as those obtained using the intact, full-length 
tntron A sequence. As explained above, the hCMV IE1 enhancer/promoter from whrch 
the intron A sequence is derived, is one of the strongest enhancer/promoters known and ts 
active in abroad range of cell types. See, e.g., Chapman et al., Nuc. Acids Res. (1991) 
19-3979-3986- and U.S. Patent No. 5,688,688. The use of active fragments from thts 
region effectively reduces the overall plasmid size for expression of a particular codmg 
sequence. This is particutarly desirable when large coding sequences, and/or vrral vectors 
with limited ability to package large genes, are used. Moreover, the decrease in overall 
0 size of the constructs effectively enhances efficiency of expression. Thus, the Intron A 
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fragments of the present invention surprisingly retain the ability to result in expression of 
protein at high levels in vitro and in vivo and, in some eases, provide for lugher 
expression than vectors usingthe entire hCMV 1E1 Intron A sequence. Asshownmthe 
examples, these high levels of expression have provided for immune responses that are 
comparable to, or even better than, that induced by the parent vector. 

As explained above, the totron A fragments for use herein will retain at least up to 
flre initial 7 nucleotides of the Intron A region, preferably at least up to the imttal 25 
nucleotides of the Intron A region (see, Figure 1 A for a representative Intron A 
sequence) In general, the Intron A fragment of the present invention will retam at least 
up to the first 30 nucleotides of the Intron A region (nucleotides 1-30 of Figure 1 A), 
generally a. leas, up to the firs. 40 nucleotides of the Intron A region (nucleotides 1 -40 of 
Figure 1 A), more preferably at leas, up to the first 5 1 nucleotides of the Intron A regton 
(nucleotides 1-51 of Figure 1 A), and even up to the firs. 75 or more nucleotides of the 
intron A region. Thus, me 5'-region may include 25, 26, 27, 28, 29, 30...50, 51, 52, 53, 
54 55 70 71 72 7 3,74,75...85,86,87ormoreoftiie5'-nucleotides,andsoon. It is 
evident tha. any number of nucleotides specified above, as well as nucleotides falling 
within .he specified numbers, are intended to be encompassed herein, so long as an 
expression construct containing the Intron A fragment functions as defined above. 

The Intron A fragment will optionally also include a sufficient amount of the 3'- 
region of Intron A to function as described herein. Generally, then, the Intron A fragment 
will include a. leas, .he 3' splice junction sequence (nucleotides 815-820 of Figure 1A), 
preferably, a. leas, up .o .he 25 3'-nucleotides of the Intron A sequence shown m F.gure 
1A (nucleotides 796-820 of Figure 1 A), preferably up to the 50 ^-nucleotides of the 
sequence shown in Figure 1 A (nucleotides 771-820 of Figure 1 A), more generally up to 
.he 70 3'-nueleotides (nucleotides 751-820 of Figure 1 A), preferably at leas, up .o .he 80 
nucleotides (nucleotides 741-820 of Figure 1A), or even more of the Region, such as 
the 100-150 ^-nucleotides, and any integer between these values, or more of the 3'-regton 
of Intron A. Thus, the 3'-portion of the Intron A fragment may include 50, 51, 52, 53, 54, 
55 70 71 7 2,73,74,75...85,86,87...90,92,93,94,95,96....110,lll,U2,andsoon, 

D or more of the ^-nucleotides of the Intron A region. It is evident mat any number of 
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nucleotides specified above, as well as nucleotides falling within the specified numbers, 
are intended to be encompassed herein. 

The 5'- and 3'-retained regions of the Intron A fragment of the present invention 
may be directly linked to one another, e.g., there may be an internal deletion of the Intron 
A sequence. This deletion may comprise, for example, 10-750 or more base pairs of the 
intact Intron A region, preferably about 300-700 base pairs, and most preferably about 
500-700 base pairs. As shown in Figure 1 A, one preferable fragment includes a large 
internal deletion of about 688 base pairs. This fragment therefore includes the sequence 
of nucleotides at positions 1-51 directly linked to nucleotides 741-834, of Figure 1 A, thus 
) comprising an internal deletion of nucleotides 52-740 of Intron A, as shown in Figure 1 A. 
Nucleotides 821-834 of Figure 1 A represent Exon 2 of the 5'-UTR. Figure 3 shows 
various Intron A fragment constructs with Intron A deletions ranging from 55 to 661 base 
pairs. 

Alternatively, the 5'- and J-regions of the Intron A fragment may be linked 
15 together via a linker sequence. The linker sequence may comprise from 1 up to about 
400 or more nucleotides, preferably from 10-100 nucleotides, or any integer between 
these values, and may comprise regions for enhancers, particular transcript factors, such 
as NFl binding sites, and the like. 

The Intron A fragment of the present invention can be isolated from a CMV 
20 genomic library, as well as from plasmids containing the Intron A region, using an 

appropriate prffobe and cloned for future use. Similarly, the sequence can be produced 
synthetically, using known methods of polynucleotide synthesis (see, e.g. Edge, M.D., 
Nature (1981) 292:756; Nambair, et al. Science (1984) 223:1299; Jay, Ernest, J. Biol. 
Chem. (1984) 259:631 1), based on the known Intron A sequence. See, e.g., Chapman et 
25 al., Nuc. Acids Res. (1991) 19:3979-3986 for the sequence and structure of the Intron A 
region in hCMV strain Towne, and Akrigg et al., Virus Res. (1985) 2:107-121, for a 
description of the corresponding region in hCMV strain AD169; and Figures 1 A and 1C 
herein. 

One particularly convenient method for obtaining the Intron A fragment of the 
30 present invention is to isolate Intron A (either alone, or in association with the rest of the 
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hCMV enhancer/promoter region) from any of the various plasmids known to contain the 
same, using techniques well known in the art, as well as described in the examples herein. 
In particular, hCMV Intron A can be obtained from plasmid pCMV6, as described in 
Chapman et al., Nuc. Acids Res. (1991) 19:3979-3986 and U.S. Patent No. 5,688,688, 
incorporated herein by reference in its entirety. Once obtained, the Intron A sequence can 
be manipulated to obtain deletion mutants thereof, such as by excising portions of the 
Intron A sequence using restriction enzymes. Site specific DNA cleavage is performed 
by treatment with a suitable restriction enzyme (or enzymes), under conditions which are 
generally understood in the art, and the particulars of which are specified by the 
manufacturer of these commercially available enzymes. See, e.g., New England Biolabs, 
Product Catalog. For example, restriction endonucleases with various specificities have 
been isolated from a wide range of prokaryotes and are well known in the art. See, e.g., 
Sambrook et al., supra. The choice of an appropriate restriction endonuclease depends on 
the particular sequence targeted. One of skill in the art will readily recognize the proper 
restriction enzyme to use for a desired sequence. If desired, size separation of the cleaved 
fragments may be performed by polyacrylamide gel or agarose gel electrophoresis, using 
standard techniques. A general description of size separations is found in, e.g., Sambrook 
et al., supra. The Intron A sequence can then be ligated to other control sequences such 
as an' appropriate promoter (if the Intron A is isolated without the remaining hCMV IE1 
enhancer/promoter region), and the desired coding sequence, using known techniques. 

The sequence of the Intron A fragment can be optimized for use in particular 
expression systems using techniques well known in the art. Additionally, portions of the 
sequence of the fragment may be changed, e.g., by deleting or substituting possible 
branch points, as well as other regions of the molecule. These regions of a representative 
Intron A are shown in Figure 1 A. One particular optimized sequence of the Intron A 
fragment is shown in Figure 1C. As explained in the examples, this fragment was 
obtained by first deleting most of the S'-sequence of the Intron A region and then 
substituting, by means of a synthetic oligonucleotide, the last 80 nucleotides of the Intron 
A region with an optimized sequence, and including Exon 2 of the 5'-UTR region. The 
optimized sequence was based on published branch point and polypyrimidine track 
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consensus sequences. Alternatively, mutagenic sequences can be obtained by 
techniques well known in the art, such as site-directed mutagenesis and polymerase cham 
reaction (PCR) techniques where appropriate. See, e.g., Sambrook, supra. 

Once obtained, the fragment can be used to direct the transcription of a desired 
protein in a wide variety of cell types. Cis-acting control elements can be conveniently 
associated with the Intron A fragment in order to optimize expression of the coding 
sequence associated therewith. If proteins produced in the system are either naturally 
secreted or engineered to be, the transformed cells may produce the protein product for 
protracted time periods, further increasing yields. The system allows for the production 
of a desired protein in an authentic configuration, with authentic post-translation 
modifications, in a relatively pure form and in economically useful amounts. 

Thus, the Intron A fragments of the present invention will find use in expression 
constructs to express a wide variety of substances, including peptides which act as 
antibiotics and antiviral agents, e.g., immunogenic peptides for use in vaccines and 
diagnostics; recombinant antibodies; antineoplastics; immunomodulators, such as any of 
the various cytokines including interleukin-1, interleukin-2, interleukin-3, interleukin-4, 
and gamma-interferon; peptide hormones such as insulin, proinsulin, growth hormone, 
GHRH, LHRH, EGF, somatostatin, SNX-111, BMP, insulinotropin, ANP, FSH, LH, PSH 
and hCG, gonadal steroid hormones (androgens, estrogens and progesterone), thyroid- 
stimulating hormone, inhibin, cholecystokinin, ACTH, CRF, dynorphins, endorphins, 
endothelin, fibronectin fragments, galanin, gastrin, insulinotropin, glucagon, GTP- 
binding protein fragments, guanylin, the leukokinins, magainin, mastoparans, 
dermaseptin, systemin, neuromedins, neurotensin, pancreastatin, pancreatic polypeptide, 
substance P, secretin, thymosin, and the like; and growth factors, such as PDGF, EGF, 
,5 KGF, IGF-1 and IGF-2, FGF, and the like. 

' More particularly, proteins for use in vaccines and diagnostics may be of viral, 
bacterial, fungal or parasitic origin, including but not limited to, those encoded by human 
and animal viruses and can correspond to either structural or non-structural proteins. For 
example, the present system will find use for recombinant^ producing a wide variety of 
30 proteins from the herpesvirus family, including proteins derived from herpes simplex 
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virus (HSV) types . and 2, such as HSV-1 and HSV-2 glycoprotcns g B gD and 
proteins derived from varicella zoster virus (VZV), Epstein-Barr virus (EBV) and 
cvtomega.ovirus (CMV) inciuding CMV gB and gH; and proteins derived from outer 
tam anherpesvirusessuchasHHV6andHHV7. (See, e.g. Chee e. a, 

review of the protein coding content of cytomegalovirus; McGeoch et a.., J. Gen. W 
(1988) f2:1531-1574, for a discussion of the various HSV-1 encoded protems; U.S^ 
Patent No. 5,171,568 for a discussion of HSV-1 and HSV-2 g B and gD proteins and the 
genes encoding therefor; Baer et al„ Nature (1984) 3K>:207-2, 1, for the identify of 
protein coding sequences in an EBV genome; and Davison and Scott, J. Gen. Virol. 
(1986) 67:1759-1816, for a review of VZV.) 

Polynucleotide sequences encoding proteins from the hepatitis family of vrruses, 
including hepatitis A vints (HAV), hepatitis B virus (HBV), hepatitis C virus (HCV), the 
de«ta hepatitis virus (HDV), hepatitis E virus (HEV) and hepatitis G vtrus (HGV , can 
also be conveniently used in the techniques described herein. By way of example, the 
viral genomic sequence of HCV is town, as are methods for obtaining the sequence. 
See eg International Publication Nos. WO 89/04669; WO 90/1 1 089; and WO 
90 ,i4436 The HCV genome encodes severa. viral proteins, including E, (also town as 
E) and E2 (also town as E2/NSI). (See, Houghton e. al., Hepatology (1991) 14:381- 
388 f„radiscussionofHCV P ro.eins,inc.udingE.andE2., The sequences encodmg 
each of these proteins, as weil as antigenic fragments thereof, wiU find use rn the present 
system. Similarly, the coding sequence for the 5-antigen from HDV is town (see, e.g., 
U S Patent No. 5,378,8.4) and this sequence can also be convenient* used in the present 
system. Additional antigens derived from HBV, such as the core antigen, the surface 
antigen, sAg, as well as the presurface sequences, preSl and preS2 (formerly called 
preS) as well as combinations of the above, such as sAg/preSl, sAg/preS2, 
sAg/preSl/preS2, and preSl/preS2, will find use herein. See, e.g., "HBV Vaccmes - 
from the .aboratory to license: a case study" in Mackett, M. and Williamson, J.D., Human 
Vaccnes ani Vaccination, pp. 159-176, for a discussion of HBV structure; Beames e. a.., 
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J. Virol ( ,995)69:6833-6838,Birnbau I ne,a>.,J Virol. (1990) 64:3319-3330; and Zhou 

et al., J. Virol. (1991) 65:5457-5464. 

Polynucleotide sequences encoding proteins derived from other viruses wtll also 
find use in the expression systems, such as without limitation, proteins from members of 
the families P.cornaviridae (e.g., poxviruses, etc.); Caliciviridae; Togaviridae (e.g., 
rubella virus, dengue virus, etc.); Flaviviridae; Coronaviridae; Reoviridae; Btrnavrtdae; 
Rhabodoviridae (e.g., rabies virus, etc.); Filoviridae; Paramyxoviridae (e.g., mumps 
virus, measles virus, respiratory syncytial vims, etc.); Orthomyxoviridae (e.g., mfluenza 
virus types A, B and C, etc.); Bunyaviridae; Arenaviridae; Retroviradae (e.g., HTLV-I; 
HTLV-n; HIV-1 (also known as HTLV-III, LAV, ARV, hTLR, etc.)), including but not 
limited to antigens from the isolates HrV inl , HIV,,, H1V LAV , HTV lA „ fflV MN ); HW- 
, HIV-1 • HTV-2; simian immunodeficiency virus (SIV) among others. See, e.g. 
Virology, 3rd Edition (W.K. Joklik ed. 1988); Fundamental Virology, 2nd Edition (B.N. 
Fields and D.M. Knipe, eds. 1991), for a description of these and other viruses. 

For example, the invention may be used in expression constructs to express genes 
encoding the gpl20 envelope protein from any of the above HIV isolates. The gp!20 
sequences for a multitude of HIV-1 and HIV-2 isolates, including members of the vanous 
genetic subtypes of HIV, are known and reported (see, e.g., Myers et al., Los Alamos 
Database, Los Alamos National Laboratory, Los Alamos, New Mexico (1992); Myers et 
al Human Retroviruses and Aids, 1990, Los Alamos, New Mexico: Los Alamos 
National Laboratory; and Modrow et al., J. Virol. (1987) 61 :570-578, for a comparison of 
the envelope gene sequences of a variety of HIV isolates) and sequences denved from 
any of these isolates will find use in the present methods. Furthermore, the invents „ 
equally applicable to other immunogenic proteins derived from any of the vanous HIV 
isolates, including any of the various envelope proteins such as gp!60 and gp41, gag 
antigens such as P 24gag and P 55gag, as well as proteins derived from the pol region. 

The present invention will also find use in expression constructs for the 
expression of influenza virus proteins. Specifically, the envelope glycoproteins HA and 
NA of influenza A are of particular interest for generating an immune response. 
, Numerous HA subtypes of influenza A have been identified (Kawaoka et al., Virology 
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(1990) 179:759-767; Webster et al., "Antigenic variation among type A influenza 
viruses," p. 127-168. In: P. Palese and D.W. Kingsbury (ed.), Genetics of influenza 
viruses. Springer-Verlag, New York). Thus, the gene sequences encoding proteins 
derived from any of these isolates can also be used in the recombinant production 

techniques described herein. 

Furthermore, the fragments described herein provide a means for producing 
proteins useful for treating a variety of malignant cancers. For example, the system of the 
present invention can be used to produce a variety of tumor antigens which in turn may 
be used to mount both humoral and cell-mediated immune responses to particular 
) proteins specific to the cancer in question, such as an activated oncogene, a fetal antigen, 
or an activation marker. Such tumor antigens include any of the various MAGEs 
(melanoma associated antigen E), including MAGE 1, 2, 3, 4, etc. (Boon, T. Scientific 
American (March 1993):82-89); any of the various tyrosinases; MART 1 (melanoma 
antigen recognized by T cells), mutant ras; mutant P 53; P 97 melanoma antigen; CEA 
5 (carcinoembryonic antigen), among others. 

It is readily apparent that the subject invention can be used to produce a variety of 
proteins useful for the prevention, treatment and/or diagnosis of a wide variety of 
diseases. 

Polynucleotide sequences coding for the above-described molecules can be 
20 obtained using recombinant methods, such as by screening cDNA and genomic libraries 
from cells expressing the gene, or by deriving the gene from a vector known to include 
the same. Furthermore, the desired gene can be isolated directly from cells and tissues 
containing the same, using standard techniques, such as phenol extraction and PCR of 
cDNA or genomic DNA. See, e.g., Sambrook et al., supra, for a description of 
25 techniques used to obtain and isolate DNA. The gene of interest can also be produced 
synthetically, rather than cloned. The nucleotide sequence can be designed with the 
appropriate codons for the particular amino acid sequence desired. In general, one will 
select preferred codons for the intended host in which the sequence will be expressed. 
The complete sequence may be assembled from overlapping oligonucleotides prepared by 
30 standard methods and assembled into a complete coding sequence. See, e.g., Edge, 
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Nature (1981) 292:756; Nambair et al., Science (1984) 223:1299; Jay et al., J- Biol. 

Chem. (1984)259:6311. 

Markers and amplifiers can also be employed in the subject expression systems. 
A variety of markers are known which are useful in selecting for transformed cell lines 
and generally comprise a gene whose expression confers a selectable phenotype on 
transformed cells when the cells are grown in an appropriate selective medium. Such 
markers for mammalian cell lines include, for example, the bacterial xanthine-guanine 
phosporibosyl transferase gene, which can be selected for in medium containing 
mycophenolic acid and xanthine (Mulligan et al (1981) Proc. Natl. Acad. Sci. USA 
78-2072-2076), and the aminoglycoside phosphotransferase gene (specifying a protein 
that inactivates the antibacterial action of neomycin/kanamycin derivatives), which can 
be selected for using medium containing neomycin derivatives such as G418 which are 
normally toxic to mammalian cells (Colbere-Garapin et al. (1981) J. Mol. Biol. 150:1- 
14) Useful markers for other expression systems, are well known to those of skill in the 
art These and other selectable markers can be obtained from commercially available 
plasmids, using techniques well known in the art. See, e.g., Sambrook et al., supra. 

Expression can also be amplified by placing an amplifiable gene, such as the 
mouse dihydrofolate reductase (dhfr) gene adjacent to the coding sequence. Cells can 
then be selected for methotrexate resistance in dhfr-deficient cells. See, e.g. Urlaub et al. 
(1980) Proc. Natl. Acad. Sci. USA 77:4216-4220; Ringold et al. (1981) J. Mol. and Appl. 
Genet 1 165-175. Constructs which include both markers and amplifiers will also find 
use in me subject expression vectors, such as any of the various EMCV-DHFR/Neo 
constructs described in, e.g., U.S. Patent No. 6,096,505, incorporated herein by reference 
in its entirety. 

Transcription termination and polyadenylation sequences may also be present, 
located 3' to the translation stop codon for the coding sequence. Examples of 
transcription terminator/polyadenylation signals include, but are not limited to, those 
derived from SV40, as described in Sambrook et al., supra, as well as a bovine growth 
hormone terminator sequence. 
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Also present in the expression constructs of the invention will be a promoter 
region The promoter may be the homologous hCMV IE1 promoter normally associated 
with the intact, full-length Intron A sequence from which the fragment is denved, a 
heterologous CMV IE1 promoter (e.g., from a different CMV strain), or even a non-CMV 
IE1 promoter. The choice of promoter will depend on the cell type used for expression 
and is readily determined by one of skill in the art. For example, typical promoters for 
mammalian cell expression include the SV40 early promoter, a CMV promoter as 
described above, the mouse mammary tumor virus LTR promoter, the adenovirus major 
late promoter (Ad MLP), the RSV promoter, the SRa promoter, the herpes stmplex vtrus 
promoter, tissue-specific promoters, among others. One particular promoter used m the 
constructs described herein is a promoter derived from the hCMV IE1 enhancer/promoter 
region depicted in Figure 2, such as approximately nucleotide positions 460 to 1264 of 
Figure 2, or functional portions of this region. Other nonviral promoters, such as a 
promoter derived from the murine metallothionein gene, will also find use for 
mammalian expression. Insect cell expression systems, typically Baculovirus systems, 
will generally include a polyhedrin promoter. Promoters for use in bacterial systems 
include promoter sequences derived from sugar metabolizing enzymes, such as galactose, 
lactose (lac) (Chang et al., Nature (1977) 198:1056), and maltose, promoter sequences 
derived from biosyrthetic enzymes such as tryptophan (trp) (Goeddel et al., Ik Aais 
Res (1980) 8:4057; Yelverton et al., Nucl. Acids Res. (1981) 2:731; U.S. Patent 
No 4 738,92V, EPO Publication Nos. 036,776 and 121,775), the b-lactamase (Wa) 
promoter system (Weissmann (1981) "The cloning of interferon and other mistakes" in 
Interferon 3 (ed. I. Gresser)), bacteriophage lambda PL (Shimatake et al., Nature (1981) 
292- 128) the T5 promoter (U.S. Patent No. 4,689,406), hybrid promoters such as too, a 
hybrid trp-lac promoter (Amann et al., Gene (1983) 25:167; de Boer et al., Proc. Natl. 
Acad. Set (1983) 80:21). Promoters useful in yeast expression systems mclude, for 
example, promoters from sequences encoding enzymes in the metabolic pathway such as 
alcohol dehydrogenase (ADH) (BPO Publication No. 284,044), enolase, glucokmase, 
g lucose-6-phosphate isomerase, glyceraldehyde-3-phosphate-dehydrogenase (GAP or 
GAPDH), hexokinase, phosphofructokinase, 3-phosphoglycerate mutase, and pyruvate 
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kinase (PyK) (EPO Publication No. 329,203) promoter, Other promoters for use m sueh 
systems include a promoter derived fern the yeast PH05 gene, encodrng acrd 
phosphatase (Myanohara et al., Proo. No,,. Acaa. Sci. USA (1983) 80,1), as well as 
synthetic sueh as a promoter formed by the fusion of UAS sequences of one yeas, 
promoter with the transcription activation region of another yeas, promoter, creating a 
synthetic hybrid promoter. Examples of such hybrid promoters include the ADH 
regulatory sequence linked to the GAP transcription activation region (U.S. Paten. No, 
4 876 197 and 4,880,734), as well as promoters which consist of the regulatory sequences 
of either the ADH2, GAL4, GAL10, or PH05 gene, combined with the transcnphonal 
, activation region of a glycolytic enzyme gene such as GAP or PyK (EPO Publication No. 
164 556) These and outer promoters can be obtained from commercially avarlable 
plasmids, using techniques well known in the art. See, e.g., Sambrook et a.., supra. 

An expression vector is constructed so that the particular coding sequence rs 
iocated in the vector with the Intron A fragment and the appropriate regulatory sequences, 
5 mepositioningandorien^tionofmecodingsequencewimrespecttomeconttol 

sequences being sueh ma, the coding sequence is transcribed under the "control" of the 
control sequences (i.e., RNA polymerase which binds to .he DNA molecule at the 
promoter tnmscribes the coding sequence). Modification of the sequences encodmg the 
molecule of interest may be desirable to achieve this end. For example, in some 
20 may be necessary to modify the sequence so that it can be attached to the promoter and 
other control sequences in the appropriate orientation; i.e., to maintain the reading frame. 
The promoter sequence and other regulatory sequences may be ligated to the eodmg 
sequence prior to insertion into a vector. Alternatively, the coding sequence can be 
cloned directiy into an expression vector which already contains the Intron A fragmen. 

25 and an appropriate restriction site. 

It may also be desirable to produce mutants or analogs of the gene of interest. 
Mutants or analogs of the polypeptide of interest may be prepared by the deletion of a 
portion of the sequence encoding the polypeptide of interest, by insertion of a sequence, 
and/or by substitution of one or more nucleotides within the sequence. Techniques for 

30 modifying nucleotide sequences, such as site-directed mutagenesis, and the like, are well 
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known to those skilled in the art. See, e.g., Sambrook et al., supra; Kunkel, T. A. (1985) 
Proc Natl Acad. Sci. USA (1985) 82:448; Geisselsoder et al. (1987) BioTechniaues 
5:786; Zoller and Smith (1983) Methods Enzymol. 100:468; Dalbie-McFarland et al. 
(1982) Proc. Natl. Acad. Sci USA 79:6409. 

Once the expression constructs axe assembled, they can be used in a wide variety 
of systems, including insect, mammalian, bacterial, viral and yeast expression systems, 
all well known in the art. Nucleic acid molecules comprising nucleotide sequences of 
interest can be stably integrated into a host cell genome or maintained on a stable 
episomal element in a suitable host cell using various gene delivery techniques well 
known in the art. See, e.g., U.S. Patent No. 5,399,346. 

For example, insect cell expression systems, such as baculovirus systems, are 
known to those of skill in the art and described in, e.g., Summers and Smith, Texas 
Agricultural Experiment Station Bulletin No. 7555(1987). Materials and methods for 
baculovirus/insect cell expression systems are commercially available in kit form from, 
inter alia, Invitrogen, San Diego CA ("MaxBac" kit). Similarly, bacterial and 
mammalian cell expression systems are well known in the art and descnbed in, e.g., 
Sambrook et al., supra. Yeast expression systems are also known in the art and described 
in e g Yeast Genetic Engineering (Barr et al., eds., 1989) Butterworths, London. 

A number of appropriate host cells for use with the above systems are also known. 
3 For example, mammalian cell lines are known in the art and include immortalized cell 
lines available from the American Type Culture Collection (ATCC), such as, but not 
limited to, Chinese hamster ovary (CHO) cells, HeLa cells, baby hamster kidney (BHK) 
cells monkey kidney cells (COS), human embryonic kidney cells, human hepatocellular 
carcinoma cells (e.g., Hep G2), Madin-Darby bovine kidney ("MDBK") cells, as well as 
•5 others. Similarly, bacterial hosts such as E. coli, BacUlus subtilis, and Streptococcus 
spp ., will find use with the present expression constructs. Yeast hosts useful in the 
present invention include inter alia, Saccharomyces cerevisiae, Candida albicans, 
Candida maltosa, Hansenula polymorpha, Kluxer omy ces fragilis, Kluyveromyces lactis, 
Pichia guillerimondii, Pichiapastoris, Schizosaccharomyces pombe and Yarrowia 
30 Hpolytica. Insect cells for use with baculovirus expression vectors include, inter alia, 
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Aedes aegypti, Autographa californica, Bombyx mori, Drosophila melanogaster, 
Spodoptera frugiperda, and Trichoplusia ni. 

A wide variety of methods can be used to deliver the expression constructs to 
cells. Such methods include DEAE dextran-mediated transfection, calcium phosphate 
precipitation, polylysine- or polyornithine-mediated transfection, or precipitation using 
other insoluble inorganic salts, such as strontium phosphate, aluminum silicates including 
bentonite and kaolin, chromic oxide, magnesium silicate, talc, and the like. Other useful 
methods of transfection include electroporation, sonoporation, protoplast fusion, 
liposomes, peptoid delivery, or microinjection. See, e.g., Sambrook et al., supra, for a 
discussion of techniques for transforming cells of interest. 

For example, the expression constructs can be packaged in liposomes prior to 
delivery to the cells. Lipid encapsulation is generally accomplished using liposomes 
which are able to stably bind or entrap and retain nucleic acid. The ratio of condensed 
DNA to lipid preparation can vary but will generally be around 1:1 (mg 
DNA:micromoles lipid), or more of lipid. For a review of the use of liposomes as carriers 
for delivery of nucleic acids, see, Hug and Sleight, Biochim. Biophys. Acta. (1991) 
1097:1-17; Straubinger et al., in Methods of Enzymology (1983), Vol. 101, pp. 512-527. 

Liposomal preparations for use with the present invention include cationic 
(positively charged), anionic (negatively charged) and neutral preparations, with cationic 
liposomes particularly preferred. Cationic liposomes are readily available. For example, 
N[l-2,3-dioleyloxy)propyl]-N,N,N-triethylammonium (DOTMA) liposomes are available 
under 'the trademark Lipofectin, from GIBCO BRL, Grand Island, NY. (See, also, 
Feigner et al., Proc. Natl Acad. Sci. USA (1987) 84:7413-7416). Other commercially 
available lipids include transferee (DDAB/DOPE) and DOTAP/DOPE (Boerhinger). 
Other cationic liposomes can be prepared from readily available materials using 
techniques well known in the art. See, e.g., Szoka et al., Proc. Natl. Acad. Sci. USA 
(1978) 75:4194-4198; PCT Publication No. WO 90/1 1092 for a description of the 
synthesis of DOTAP (l,2-bis(oleoyloxy)-3-(trimethylammonio)propane) liposomes. The 
various liposome-nucleic acid complexes are prepared using methods known in the art. 
See, e.g., Straubinger et al., in METHODS OF IMMUNOLOGY (1983), Vol. 101, pp. 
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512-527- Szoka et al., Proc. Natl. Acad. Sci. USA (1978) 75:4194-4198; Papahadjopoulos 
et al Biochim. BiopHys. Acta (1975) 394:483; Wilson et al., Cell (1979) 11:17); Deamer 
and Bangham, Biochim. Biophys. Acta (1976) 443:629; Ostro et al., Biochem. Biophys. 
Res Commun. (1977) 76:836; Fraley et al., Proc. Natl. Acad. Sci. USA (1979) 76:3348); 
Enoch and Strittmatter, Proc. Natl. Acad. Sci. USA (1979) 76:145); Fraley et al., J. Btol. 
Chem. (1980) 255:10431; Szoka and Papahadjopoulos, Proc. Natl. Acad. Sci. USA (1978) 
75:145; and Schaefer-Ridder et al., Science (1982) 215:166. 

The DNA can also be delivered in cochleate lipid compositions similar to those 
described by Papahadjopoulos et al., Biochem. Biophys. Acta. (1975) 394:483-491. See, 
also, U.S. Patent Nos. 4,663,161 and 4,871,488. 

Depending on the expression system and host selected, the molecules are 
produced by growing host cells transformed by an expression vector described above 
under conditions whereby the protein is expressed. The expressed protein is then isolated 
from the host cells and purified. If the expression system secretes the protein mto growth 
media, the product can be purified directly from the media. If it is not secreted, it can be 
isolated from cell lysates. The selection of the appropriate growth conditions and 
recovery methods are within the skill of the art. For example, once expressed, the 
product may be isolated and purified by any number of techniques, well known in the art, 
including: chromatography, e.g., HPLC, affinity chromatography, ion exchange 
chromatography, size-exclusion, etc.; electrophoresis; density gradient centrifugation; 
solvent extraction, or the like. See, e.g., Protein Purification Principles and Practice, 
2nd edition (Robert K. Scopes ed. 1987); and Protein Purification Methods, a Practical 
Approach (E.L.V. Harris and S. Angal, eds. 1990). 

The expression constructs of the present invention may also be used for nucleic 
acid immunization and gene therapy, using standard gene delivery protocols. Methods 
for gene delivery are known in the art. See, e.g., U.S. Patent Nos. 5,399,346, 5,580,859, 
5 589 466, incorporated by reference herein in their entireties. Genes can be delivered 
either directly to the vertebrate subject or, alternatively, delivered ex vivo, to cells derived 
from the subject and the cells reimplanted in the subject. 
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A number of viral based systems have been developed for gene transfer into 
mammalian cell, For example, retrev^^ . 
delivery systems. A selected gene can be inserted into a vector and packaged m retroviral 
panicles using tecbmques known in the art. The recombinant virus can men be .so ated 
and delivered to cells of the subject either ft, vivo or ex vM, A number of re«rov,ral 
systems have been described (U.S. Patent No. 5,219,740; Miller and Rosman, 

(1989) J9S0-990; Miller, A.D., Human Gene Tnerapy (1990) 1>M; 
Scarpa e. al., Virology (1991) 180:849-852; Burns et a,., Proc. Natl Acad. Sc. USA 
(1993) 90-8033-8037; and Boris-Lawrie and Temin, Cur. Opin. Gene,. Develop. (1993) 
3,102-109. Briefly.retroviralgenedeliveryvehidesofthepresentinventionmaybe 
Readily construct* from a wide variety of retroviruses, including for example^ B£ and 
D type retroviruses as well as spumaviruses and lentiviruses such as FIV, HIV, HIV-1, 
HIV-2 and SIV (see RNA Tumor Viruses, Second Edition, Cold Spring Harbor 
Ubora.ory.1985). Such retroviruses may be readily obtained from depositor.es or 
collections such as dte American Type cmrure Co„ec«ion ("ATCC"; 1080, Universtty 
Blvd., Manassas, VA 201 10-2209), or isolated from known sources using commonly 

available techniques. 

A number of adenovirus vectors have also been described. Unlike retrovtruses 
which inte^ate into the host genome, adenoviruses persist exttachromosomally thus 
minimizing the risks associated with insertional mutagenesis (Haj-Ahmad and Graham, J. 
V,ro, (1986) 57:267-274; Bett e« al., J. Virol (1993) ©591 1-5921; Mittereder e. al., 
Human GeneTherapy (1994) 5:717-729; Seth et al., J. Virol. (1994) ~ "^^^J* 
a,., Gene Tnerapy (1994) 1:5,-58; Berkner, K.L. BioTeOniaues (1988) 6:616-629; and 
Rich et al., Human Gene Therapy (1993) 4:461-476). 

Addttionally, various adeno-associated virus (AAV) vector systems have been 
developed for gene delivery. AAV vectors can be readily constructed using techmques 
well known in ore art. See, e.g., U.S. Paten. Nos. 5,173,414 and 5,,39,941 ; Intemafonal 
Publication Nos. WO 92/01070 (published 23 January 1992) and WO 93/03769 
(published 4 March 1993); Lebkowski e. al., Molec. Ceil Biol. (1988) 8:3988-3996; 
0 Vinccn. e. al., Vaccines 90 (1990) (Cold Spring Harbor Laboratory Press); Carter, B.J. 
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Current Opinion in Biotechnology (1992) 3:533-539; Muzyczka, N. Current Topics m 
Microbiol and Immunol. (1992) 158:97-129; Kotin, R.M. Human Gene Therapy (1994) 
5:793-801; Shelling and Smith, Gene Therapy (1994) 1:165-169; and Zhou et aL, J. Exp. 
Med. (1994)179-1867-1875. 

Molecular conjugate vectors, such as the adenovirus chimeric vectors described in 
Michael et aL. Biol. Chem. (1993) 268:6866-6869 and Wagner et al., Proc. Natl. Acad. 
Sci USA (1992) 89:6099-6103, can also be used for gene delivery. 

Members of the Alphavirus genus, such as but not limited to vectors derived from 
the Sindbis, Semliki Forest viruses and VEE, will also find use as viral vectors for 
delivering the gene of interest. For a description of Sinbus-virus derived vectors useful 
forthe practiceofthe instant methods, see, Dubensky et al., Virol. (1996)70:508-519; 
and International Publication Nos. WO 95/07995 and WO 96/17072. 

The expression constructs of the present invention can also be delivered without a 
viral vector. For example, the construct can be delivered directly, or packaged in 
liposomes prior to delivery to the subject or to cells derived therefrom, as descnbed 
above. 

The expression constructs may also be encapsulated, adsorbed to, or associated 
with particulate carriers. Such carriers present multiple copies of a selected molecule to 
the immune system and promote trapping and retention of molecules in local lymph 
nodes The particles can be phagocytosed by macrophages and can enhance antigen 
presentation through cytokine release. Examples of particulate carriers include those 
derived from polymethyl methacrylate polymers, as well as microparticles denved from 
poly(lactides) and poly(lactide-co-glycolides), known as PLG. See, e.g., Jeffery et al., 
Pharm Res. (1993) 10:362-368; and McGee et al., J. Microencap. (1996). 

Furthermore, other particulate systems and polymers can be used for the in vivo or 
ex vivo delivery of the expression constructs. For example, polymers such as polylysine, 
polyarginine, polyomithine, spermine, spermidine, as well as conjugates of these 
molecules, are useful for transferring a nucleic acid of interest. Similarly, DEAE dextran- 
mediated transfection, calcium phosphate precipitation or precipitation using other 
) insoluble inorganic salts, such as strontium phosphate, aluminum silicates including 
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bentonite and kaolin, chromic oxide, magnesium silicate, talc, and the like, will find use 
with the present system. See, e.g., Feigner, P.L., Advanced Drug Delivery Reviews 
(1990) 5:163-187, for a review of delivery systems useful for gene transfer. 

Additionally, biolistic delivery systems employing particulate carriers such as 
gold and tungsten, are especially useful for delivering the expression constructs of the 
present invention. The particles are coated with the construct to be delivered and 
accelerated to high velocity, generally under a reduced atmosphere, using a gun powder 
discharge from a "gene gun." For a description of such techniques, and apparatuses 
useful therefore, see, e.g., U.S. Patent Nos. 4,945,050; 5,036,006; 5,100,792; 5,179,022; 
5,371,015; and 5,478,744. 

p T »cit g nf Strains Usefiil ™ Practicing the Invention 

A deposit of biologically pure cultures of the following strains was made with the 
American Type Culture Collection, 10801 University Boulevard, Manassas, VA. The 
accession number indicated was assigned after successful viability testing, and the 
requisite fees were paid, made under the provisions of the Budapest Treaty on the 
International Recognition of the Deposit of Microorganisms for the Purpose of Patent 
Procedure and the Regulations thereunder (Budapest Treaty). This assures maintenance 
of viable cultures for a period of thirty (30) years from the date of deposit. The 
organisms will be made available by the ATCC under the terms of the Budapest Treaty, 
which assures permanent and unrestricted availability of the progeny to one determined 
by the U.S. Commissioner of Patents and Trademarks to be entitled thereto according to 
35 U.S.C. §122 and the Commissioner's rules pursuant thereto (including 37 C.F.R. §1.12 
with particular reference to 886 OG 638). Upon the granting of a patent, all restrictions 
on the availability to the public of the deposited cultures will be irrevocably removed. 

These deposits are provided merely as convenience to those of skill in the art, and 
are not an admission that a deposit is required under 35 U.S.C. §112. The nucleic acid 
sequences of these genes, as well as the amino acid sequences of the molecules encoded 
thereby, are incorporated herein by reference and are controlling in the event of any 
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conflict with the description herein. A license may be required to make, use, or sell the 
deposited materials, and no such license is hereby granted. 



Plasmid Deposit Date ATCCNo. 

pCON3 September 27, 2000 PTA-2504 



TTT Ex perimental 

Below are examples of specific embodiments for carrying out the present 
invention. The examples are offered for illustrative purposes only, and are not intended 
to limit the scope of the present invention in any way. 

Efforts have been made to ensure accuracy with respect to numbers used (e.g., 
amounts, temperatures, etc.), but some experimental error and deviation should, of 

course, be allowed for. 

Restriction and modifying enzymes, as well as other reagents for DNA 
manipulations were purchased from commercial sources, and used according to the 
manufacturers' directions. In the cloning of DNA fragments, except where noted, all 
DNA manipulations were done according to standard procedures. See, e.g., Sambrook et 
al., supra. 

Example 1 

Portion of Exer tion Construct s Tending Tntron A Fragments 
A series of 13 expression constructs were made which deleted from between 54 
and 688 nucleotides from within the core region of Intron A, bounded by the splice donor 
and branch point sites. The expression constructs were linked to the firefly (Photinus 
pyralis) luciferase gene or to a codon-optimized HIV P 55gag gene (Zur Megede et al., J. 
Virol. (2000) 74:2628-2635.) 



33 



PP1609S.002 A A 
PATENT W 

The initial deletion of Intron A was prepare* by means of substituting a 778 base 
pair m - S* fragment from plasmid pCMVkmLuc (Internationa. Publication No. WO 
98/06437) with a synthetic oligonucleotide (Figure IB) that restored the last 80 
nucleotides of Wron A (with optimized branch point and polypyrimidine tract sequences 
as shown in Figure IB), together with Exon 2 of tire 5'-UTR (nucleotides 82.-834 of 
Figure 1 A). The resulting construct contained a 688 bp deletion from Intron A and ,s 
shown in Figure 1A. The resulting expression plasmid, pCON3, contains toe hCMV 
enhancer/promoter region with a 130 bp Intron A fragment. The fmal sequence of the 
intron in pCON3 is shown in Figure 1C. 

Twelve additional Intron A deletion constructs were made by progressive deletion 
within plasmid pCMW (U.S. Patent No. 6,096,505) in either the 5' -3' direction from 
the unique Nstt site toward the unique Hp* site, or in the 3-5' direction from the Hpal 
site toward the Nsil site (see, Figure 3 and Table 1). Following the restriction enzyme 
digests the plasmids were treated with T4 DNA polymerase and excess dNTPs. 
Resulting blunt-ended vector fragments were gel-purified and self-tigated. As shown » 
Figure 3 these constructs included deletions within the intron ranging from 54 to 663 
base pairs in length. To generate expression vectors carrying the resulting tntron 
modifications, the Mfel-M fragment from the truncation plasmids was substituted mto 
plasmid pCMVkmLuc digested with NM and Sail. Of these constructs, selected ones 
were digested with SaR-Xbal to generate recipient vector fragments for the insertion of 
the codon-optimized HIV P 55gag gene obtained by digestion of plasmtd 
pCMVkm2.GAGmod.SF2 (Zur Megede et al., J. Virol (2000) 74:2628-2635). 
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Example 2 

r . prrH ^ „ u^no™* Coding crwnrr Using mtron A Fragments 
293 (ATTC Accession No. CRL-1573) and RD (ATTC AccessionNo. CCL-136) 
cells were grown in DMEM medium supplemented with fetal calf serum (10% v/v). 
Fourteen hours prior to transfection, 2*10 5 cells/well were seeded into 6 well plates. 
Transient transfection was done using 2 ug of the vector DNA described above, per well 
using 12 us of Fugene 6 (Roche Molecular Biochemicals, Indianapolis, IN) per supplier 
instructions in 6 replicate wells per construct. Forty-eight hours post-transfection, cell 
lysates were analyzed for reporter gene expression. HIV P 55gag expression was 
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evaluated by means of a P 24 antigen ELIS A (Coulter, Miami, FL). Geometric mean 
titers across each plate (construct) were calculated. 

Transient transfection of 293 cells and evaluation of luciferase expression 
indicated that nearly all of these derivatives expressed as well as or better than the parent 
vector pCMVkm-Luciferase containing the full-length Intron A sequence. The 
constructs containing the two largest intron deletions (pCon3, ^pal-Celll) showed the 
greatest enhancement, approximately two-fold greater than the parent vector (F lg ure 4). 

To further evaluate the effect on expression of a smaller intron, the entire 
sequence of Intron A was substituted with the 126 base pair Intron I from the rabbit P - 
globin gene (RPG-IVSI). Figure 5A shows the wild-type rabbit P-globin gene sequence 
used. In vitro analysis of P 55gag expression indicated that the wild-type construct 
expressed up to approximately 1.8 times higher than the parent vector, pCMVkm- 
Luciferase (Figure 6) . The wild-type sequences for the splice donor, branch point and 
polyY tract of RPG-IVSI are suboptimal relative to the consensus sequences for these 
elements Therefore, the construct containing RPG-IVSI was modified such that these 
sequence elements were optimized. Figure 5B shows the optimized rabbit (3-globin gene 
sequence used, termed R^G-OPTI. Analysis of this construct showed approximately 4 
times higher p55gag expression as compared to the parent vector in vitro (Figure 6). 

All 14 modified-intron constructs were analyzed for efficiency of RNA transcript 
splicing by RT-PCR. For RNA transcript analysis, 293 cells were transiently transfected 
and then lysed using RNAstat 60 (Tel-Test B, Inc., Friendswood, TX) to yield total cell 
RNA. Extracted RNA was digested with RQl-Dnase (Promega Corp, Madison, WI) and 
subjected to RT-PCR using the GeneAmp RNA PCR kit (Roche Molecular 
Biochemicals, Indianapolis, IN). PCR spanning the region of the intron was done using 
an upstream primer in exon 1 of the 5' UTR [primer "KBT-162"; seq. 
CGCTGTTTTGACCTCCATA (SEQ ID NO:7)] and a downstream primer from the 

r™wr «KRT 1 63" • sea GTTGAGCAATTCACGTTCAT (SEQ 
luciferase reporter gene [primer KB l - 1 o J , seq. u i 

ID NO:8)]; a control PCR of actin transcripts was also performed for each RNA 
preparation. All of the mutants spliced efficiently, within the sensitivity of the assay, as 
no products of lengths predicted for unspliced messages were detected. 
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Example 3 

xr.ir W. Add Immuniz ation Using the Tntron A Fragment 
In order to test the ability of the Intron A fragments to direct transcription in vivo, 
5 Balb/C mice in groups of 6 animals (Charles River Co., Willmington, MA) were 

immunized once bilaterally in the tibialis anterior muscle with 5 ^g of naked vector DNA 
per injection site (prepared endotoxin-free [Qiagen, Inc., Valencia, CA] and formulated in 
normal saline). Three- and six-week post-immunization bleeds were analyzed by ELISA 
for anti-HIV P 55gag antibody as described in Zur Megede et al., J. Virol. (2000) 74:2628- 
10 2635. 

The constructs evaluated are shown in Figure 7. Variable immunogenicites were 
seen after a single immunization (see, Figure 7). Significantly, the pCON3 vector which 
deleted approximately 85% of Intron A yielded higher geometric mean titers than the 
parent pCMVkm2.GAGmod.SF2 vector (Figure 7). At three weeks post-immunization, 
the titer was approximately twice that of the parent vector though this fell off by six 

weeks post-injection. 

Accordingly, novel hCMV Intron A fragments and methods of using the same 
have been disclosed. From the foregoing, it will be appreciated that, although specific 
embodiments of the invention have been described herein for purposes of illustration, 
20 various modifications may be made without deviating from the spirit and scope of the 
appended claims. 
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